03. Visualizing Data Lineage

03 Visualizing Data Lineage-

Data Lineage

What is the data lineage of a dataset?

SOLUTION: Description of the discrete steps involved in the creation, movement, and calculation of that dataset

Data lineage benefits

Which of the following are benefits of visualizing data lineage?

SOLUTION:
  • Builds confidence in our users that our data pipelines are designed properly
  • Helps organizations surface and agree on dataset definitions
  • Makes locating errors more obvious

Data Lineage in Airflow

Which components of Airflow can be used to track data lineage?

SOLUTION:
  • Rendered code tab for a task
  • Graph view for a DAG
  • Historical runs under the tree view